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SECTION  I 
INTRODUCTION 


The  objectives  of  this  program  are  to  design,  fabricate,  and  evaluate  a 
genera  1 -purpose ,  electronically  programmable,  binary/analog  CCD  correlator. 

The  device  features  programmabi 1  iiy  of  the  reference  signal,  the  correlator 
length,  and  weighting  coefficient  accuracy.  Off-chip  circuitry  is  minimized  by 
including  all  clock  drivers,  output-amplifiers,  logic,  and  analog  and  digital 
shift  registers,  on-chip.  This  results  in  an  integrated  filter  circuit  that 
has  the  versatility  to  be  operated  in  nine  programmable  configurations,  thereby 
giving  it  the  flexibility  to  be  used  in  a  wide  variety  of  filtering  and  correla¬ 
tion  appl ications. 

Section  I  of  this  report  describes  the  operation  of  the  programmable 
binary/analog  correlator  and  the  architecture  necessary  to  realize  a  general- 
purpose  integrated  filter.  Section  I  concludes  with  a  breakdown  of  the  princi¬ 
pal  constituent  cells  of  the  general-purpose  correlator  and  a  summary  of  the 
accomplishments  to  date.  Section  II  contains  a  technical  discussion  of  the 
circuits  designed. 

A .  3  inary/Analog  Correlator  Operation 

In  a  conventional  CCD  transversal  filter  the  weighting  coefficients  are 
implemented  by  use  of  the  well-known  split  electrode  technique  and  are  there¬ 
fore  fixed.  Although  such  filters  have  a  number  of  important  applications  in 
Air  Force  Systems,  there  are  applications  where  it  is  necessary  to  electronically 
program  the  weighting  coefficients.  Of  the  present  programmable  techniques, 
the  most  attractive  for  matched  filtering  and  correlation  tracker  applications 
is  the  binary/analog  approach. 

Electronic  programmability  is  achieved  in  the  binary/analog  approach  by 
decomposing  each  one  of  the  weighting  coefficients  into  a  binary  representat ion 
that  can  be  loaded  into  a  static  shift  register,  as  illustrated  in  Figure  1. 
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For  M-bit  accuracy  of  each  weighting  coefficient  M  parallel  CCD  binary/ana log 
correlators  are  required,  so  that  h^  is  represented  with  M-bit  accuracy  by 
M-l 

h  =  Y  hk  2_k  .  (i) 

n  L-k  n 

k=0 

The  most  significant  bit  h^  of  each  weighting  coefficient  is  loaded  into 

the  static  shift  register,  shown  as  elongated  rectangles  in  the  filter  at  the 

top  of  the  figure.  The  second  most  significant  bit  h*  of  each  coefficient  is 

^  M  - 1 

loaded  into  the  second  coefficient  store,  and  the  least  significant  bit  h 
is  loaded  into  the  coefficient  store  shown  at  the  bottom  of  the  figure.  The 
analog  signal  must  be  weighted  by  the  appropriate  factor  of  2  11  at  either 

the  input  or  output  of  each  correlator.  In  Figure  1  the  weighting  is  performed 
at  the  input  to  the  correlators,  with  the  analog  signal  applied  without  attenua¬ 
tion  to  the  top  filter  (most  significant  bit)  and  attenuated  by  a  factor  of  two 
at  the  input  of  the  second  filter  (second  most  significant  bit).  At  the  input 
to  the  bottom  filter  (least  significant  bit)  it  is  attenuated  by  a  factor  of 


The  choice  of  whether  input  or  output  signal  weighting  is  employed,  or  a 
mixture  of  both,  depends  on  the  application.  Input  signal  weighting  is  simpler 
to  implement  as  capacitor  ratio  techniques  can  be  used,  whereas  output  weighting 
requires  amplifiers  with  accurately  controlled  gain.  However,  output  signal 
weighting  is  desirable  in  some  applications  because  of  a  dynamic  range  limita¬ 
tion  associated  with  the  input  weighting  configuration.  For  general-purpose 
correlators  it  may  be  desirable  to  perform  a  mixture  of  both  input  and  output 
weighting  to  strike  a  balance  between  dynamic  range,  the  number  of  integrators 
required,  weighting  coefficient  accuracy,  and  logic  simplicity. 
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The  total  weighting  coefficient  for  any  one  of  the  M  parallel  CCD  bits 
is  determined  by  the  sum  of  the  coefficients  in  the  M  parallel  static  shift 
register  bits.  Consequently,  when  the  outputs  of  each  filter  are  summed  to¬ 
gether  as  shown  in  Figure  1,  the  result  is 


H(z)  = 


V  „(z) 

out 

V.  (z) 
in 


N 

=  e 

n=  1 


h°  z’n 

n 


+  2 


N 

£ 

n~  1 


,  l  -n 
h  z  + 
n 


,-(M-l) 


N 

£ 

n=  1 


M-1 


(2) 


N 

=  E 

n=  1 


N 

=  E 

n=  1 


h 

n 


z 


-n 


(3) 
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Programmabil ity  of  the  CCD  weighting  coefficients  is  achieved  by  using  the 
coefficient  code  in  the  static  shift  register  to  determine  the  relative  timing 
of  charge  transfer  in  the  four-phase  CCD  register.  This  is  described  in  detail 
in  Section  II. B.  In  essence,  the  coefficient  code  in  each  of  the  static  shift 
register  cells  determines  when  the  charge  in  the  corresponding  CCD  bit  transfers 
to  the  electrode  that  is  connected  to  the  DCI.  If  the  charge  is  transferred 
early  at  tj,  the  charge  packet  is  sampled  underneath  the  sense  electrode  and 
contributes  to  the  integrator  output  during  that  cycle.  If  the  charge  is  trans¬ 
ferred  late  at  t£,  after  the  integrator  sampling  period  has  finished,  the  charge 
dees  not  contribute  to  the  output.  In  the  time  interval  between  t,  and  t2  the 
output  signal  is  available  and  given  by: 


Qout(n)  = 


N 

E 

i=1 


h. 

l 


Q  .  (n  - 
sigv 


i) 


(5) 


As  described  thus  far,  the  device  can  only  handle  unipolar  signals.  For 
a  signal  of  either  sign  to  be  used  it  must  be  added  to  a  fat  zero,  which  also 
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improves  the  charge  transfer  efficiency.  The  actual  CCD  input  is  then  Q, 
+  and  the  output  from  a  single  binary/analog  filter  is  given  by: 


Q  ,i4.(n)  =  £  h.  I  (Q__  +  Q .  )  (n  -  i)  | 


out 


i=l 


1  L  FZ 


sig 


(6) 


a  (n) 

*outv  ’ 


V  h.  «FZ  (n  -  i) 


h.  Q  .  (n 
i  sig 


i) 


(7) 


The  first  term  represents  a  code-dependent  offset  and  must  be  eliminated.  The 
second  term  is  the  desired  signal.  Tc  eliminate  the  first  term,  a  parallel 
filter  must  be  added  in  which  the  input  signal  is  inverted,  i.e.,  the  input  to 


the  second  CCD  becomes  Q__  -  Q  .  . 

FZ  sig 

given  by: 


The  output  of  the  parallel  filter  is  then 


N 

Q  _(n)  =  7!  h.  Q,  (n  -  i)  -  h.  Q.  .  (n  -  i) 
out  i-1  1  i  sig 


(8) 


When  the  two  CCD  outputs  are  connected  to  a  differential  current  integrator, 
the  undesired  terms  cancel,  resulting  in  the  DCI  output  being  given  by 


N 


Q_.*(n)  =  I  2h.  0  .  (n  -  i) 


out 


i=l 


i  sig 


(9) 


The  differential  approach  for  implementing  coefficients  with  multiple-bit 
accuracy  is  illustrated  in  Figure  2,  where  the  configuration  correlates  N  analog 
samples  with  N  digital  words,  each  of  which  is  represented  by  M-bits.  In  this 
scheme  the  weighting  is  applied  to  the  analog  signals  at  the  input  of  the  CCD 
shift  registers.  The  summing  of  the  M  parallel  channels  is  accomplished  by  a 
parallel  connection  of  the  0^  and  0^  clock  buses.  Thus,  only  one  differential 
current  integrator  (DCI)  is  required. 
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The  salient  points  of  the  binary/analog  approach  to  programmable  correlators 

are: 

•  One-bit  programmable  filters  in  which  h^  can  be  1  or  0. 

•  Capability  to  expand  to  M-bit  accuracy  by  combining  M  binary/analog 
correlators  in  parallel  with  the  appropriate  gain. 

•  Capability  to  program  the  filters  to  the  desired  length  (N  =  32,  64, 

128,  etc.)  and  to  cascade  filters  to  achieve  longer  filters. 

•  Weighting  coefficient  accuracy  sufficient  to  give  eight-bit  resolution 
on  weighting  coefficients  (i.e.,  weighting  coefficient  accuracy  of  0.2%, 

4  LSB). 

•  Minimization  of  off-chip  circuitry.  All  clock  drivers,  output  amplifiers, 
static  shift  registers,  etc.,  are  on-chip. 

•  •  Capability  to  utilize  a  microprocessor  to  read  data  out  of  memory  into 
the  code  registers. 

•  Elimination  of  the  code-dependent  output  dc  level  problem.  A  code¬ 
independent  dc  level  is  essential  for  spread  spectrum  and  ECM  applica¬ 
tions. 

B .  Architecture  for  a  General-Purpose  Correlator 

A  general -purpose  architecture  for  an  electronical 1y  programmable  1024-stage 
binary/ana 1 og  correlator  is  shown  in  the  simplified  block  diagram  of  Figure  3- 
The  architecture  is  arranged  so  that  the  1024-stage  device  is  folded  into  eight 
128-stage  sections  with  the  output  of  each  section  fed  via  a  unity  gain  amplifier 
to  the  input  of  the  next  128-stage  device.  This  provides  a  convenient  point  to 
insert  an  alternative  input  signal,  together  with  the  necessary  logic  for  select¬ 
ing  the  filter  configuration  desired. 

To  achieve  multiple-bit  accuracy,  some  of  the  signal  weighting  is  performed 
at  the  input  and  some  at  the  output,  as  this  yields  the  optimum  balance  between 
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128-stage  by  1  bit 
correlator 


Flowgraph  of  1024-S 
ed  as  a  256-stage  b 


dynamic  range,  weighting  coefficient  accuracy,  the  number  of  DCIs,  and  logic 
simplicity.  The  rectangles  labeled  XI,  X-|  through  which  the  input  passes 
indicate  that  the  input  signal  is  passed  with  unity  gain  (XI)  and  with  50% 
attenuation  (X-^)  .  The  input  selection  switches  to  the  correlators  determine 
whether  the  attenuated  or  unattenuated  signal  is  applied  to  the  correlator 
input.  The  remainder  of  the  signal  weighting  is  performed  at  the  output  of  the 
correlators. 

The  correlation  outputs  of  the  correlators  (output  at  the  top  of  each 
correlator)  go  to  the  differential  current  integrators  and  summing  amplifiers 
labeled  Z.  By  summing  the  correlation  outputs  in  pairs,  the  number  of  DCI 
amplifiers  required  is  only  four.  The  DCI  outputs  are  summed  through  two  switched, 
weighted  summation  stages  that  determine  the  mode  of  operation. 

The  flexibility  of  the  architecture  chosen  to  realize  the  general-purpose 
correlator  chip  is  illustrated  by  the  nine  possible  operating  configurations 
described  below. 

Single  Mode 

1.  1024-stage  by  1  bit.  In  this  mode  the  input  is  applied  to  V 1 ° ( 1 ) , 

and  the  output  is  taken  from  V°Ut(7).  The  inputs  to  each  correlator  are  switched 
to  the  lower  position  so  that  the  serial  output  from  the  preceding  correlator 
is  applied  to  the  input.  The  switches  on  the  outputs  all  select  the  unity  gain 
ampl if iers. 

2.  512-stage  by  2  bits.  In  this  mode  of  operation  the  input  is  applied  to 
Vin(l),  and  the  output  is  taken  from  V°Ut(7).  The  XI  input  is  applied  to 
correlator  1,  and  x£  input  is  applied  to  correlator  2.  The  inputs  of  all  other 
correlators  are  switched  to  the  central  position  such  that  the  unattenuated  (XI) 
signal  passes  from  correlator  1  to  correlator  3  to  correlator  5  to  correlator  7. 
and  the  attenuated  (X^-)  signal  passes  from  correlator  2  to  correlator  4  to 
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correlator  6  to  correlator  8.  The  output  amplifiers  are  all  switched  to  the 
unity  gain  setting  and  summed  to  provide  the  output  at  V°Ut(7). 

3.  256-stage  by  4  bits.  Four-bit  and  8-bit  operation  are  the  only  modes 
that  utilize  output  amplifier  settings  at  other  than  unity  gain.  The  connections 
for  this  mode  of  operation  are  illustrated  in  Figure  3.  The  same  input  is 
applied  to  both  Vin(l)  and  Vln(3).  The  top  four  correlators  perform  a  256-stage 
by  2-bit  correlation  with  the  most  significant  bit  (MSB)  and  the  second  signifi¬ 
cant  bit  (2SB) ,  and  the  bottom  four  correlators  perform  a  256-stage  by  2-bit 
correlation  with  the  third  significant  bit  (3SB)  and  the  least  significant  bit 
(LSB).  The  outputs  V°tJt(5)  and  V°Ut(6)  are  then  summed  together  with  V°Ut(6) 
being  summed  through  the  X5  amplifier  to  provide  the  4-bit  weighted  output  at 

vout(7). 

4.  128-stage  by  8  bits.  In  this  mode  of  operation  the  input  is  applied 
simultaneously  to  Vln(l),  Vln(2),  Vin(3),  and  Vln(4).  The  top  four  correlators 
perform  a  128-stage  by  4-bit  correlation  with  the  MSB,  2SB,  3SB,  and  4SB;  the 
bottom  four  correlators  perform  a  128-stage  by  4-bit  correlation  with  the  5SB, 

6SB,  7SB,  and  LSB.  The  output  VOUt(6)  is  then  summed  through  the  XI/16  amplifier, 
together  with  V°ut(5),  to  provide  the  8-bit  correlation  at  V°ut(7). 

Dual  Mode 

5.  512-stage  by  1  bit.  In  this  mode  the  two  inputs  are  applied  to  Vln(l) 
and  Vln (3) ,  and  the  outputs  are  taken  from  V°ut(5)  and  VOUt (6) .  Operation  is 
similar  to  the  1024-stage  by  1  bit  except  that  the  switch  at  the  input  to  the 
fifth  correlator  is  in  the  upper  position  so  that  Vin(3)  is  applied  to  the  input. 
The  outputs  V°ut(5)  and  V°ut(6)  are  taken  separately  instead  of  being  summed  to 
give  V°Ut(7). 

6.  256-staqe  by  2  bits.  In  this  mode  operation  is  similar  to  the  512-stage 
by  2-bit  mode  except  that  inputs  are  made  at  Vin(l)  and  Vin(3),  and  outputs  are 
taken  at  V°ut(5)  and  V0ut(6). 
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7.  128-stage  by  4  bits.  In  this  mode  of  operation  the  top  four  correlators 

perform  one  128-stage  by  4-bit  correlation  and  the  bottom  four  perform  another. 
Input  1  is  applied  simultaneously  to  Vin(l)  and  Vln(2),  and  input  2  is  applied 
simultaneously  to  Vln(3)  and  Vin(4).  V°ut(2)  and  V°ut(4)  are  summed  through  the 

X^  amplifier,  to  V°Ut(l)  and  V°ut(3)  to  provide  the  two  outputs  at  V°Ut( 5)  and 

vout(6) . 

Quad  Mode 

8.  256-stage  by  1  bit.  In  this  mode  the  four  inputs  are  applied  to  V 1 n ( 1 ) , 
V*n(2),  Vin (3) ,  and  V*n(4);  and  the  four  outputs  are  taken  at  V°ut(1),  VOUt(2)f 
V°Ut (3) ,  and  V°Ut(4).  Correlators  1,  3,  5,  and  7  select  the  XI  input  (upper 
position);  and  correlators  2,  4,  6,  and  8  select  the  output  from  the  previous 
correlator  (lower  position). 

9.  128-stage  by  2  bits.  In  this  mode  the  pairs  of  filters  operate  sep¬ 
arately.  Inputs  are  made  at  Vln(1),  Vin(2),  Vin(3),  and  V 1 n (4) ;  outputs  are  taken 
at  V°Ut(l),  V°Ut(2),  V°Ut(3) ,  and  V°ut(4). 

C.  Constituent  Cells 

The  architecture  of  the  general-purpose  binary/analog  correlator  can  be 
broken  down  into  the  following  principal  on-chip  cells. 

•  16  x  128-bit  four-phase  CCDs 

•  16  x  64-bit  static  binary  shift  registers  for  storing  the  reference 
signal 

•  Switching  circuitry  for  selecting  the  filter  configuration  and  operating 
mode;  either  single,  dual  or  quad 

•  16  x  unity  gain  charge  amplifiers  (CVAMPs)  for  linking  the  output  of  one 
128-stage  CCD  to  the  input  of  the  next 

•  7  x  DCIs/summing  amplifiers 

•  Microprocessor  interface  circuitry  for  loading  the  reference  signal 
from  memory  into  the  static  shift  registers 

•  Clock  generation  and  drive  circuits. 
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Accomplishments  to  Date 

•  An  overall  layout  scheme  has  been  developed  (see  Figure  4)  resulting 

-  2  2 

in  a  chip  size  of  approximately  7-1  x  6.2  mm  (280  x  245  mils  ). 

•  A  bus  line  structure  has  been  devised  that  strikes  a  balance  between 
length  of  routing  (and  hence  resistance  and  capacitance)  with  the 
njmber  of  crossovers  and  crossunders. 

•  The  input,  output,  and  filter  mode  selection  circuitry  has  been  designed 
and  laid  out. 

•  The  digital  shift-register  cell  has  been  designed,  laid  out,  and 
simulated.  The  cell  has  also  been  included  on  a  TI-funded  test  bar 
and  should  be  available  for  evaluation  during  March. 

•  The  CVAMP  design  has  been  completed,  laid  out,  and  /imulated.  This 
amplifier  has  also  been  included  on  the  same  test  bar  as  the  digital 
shift-register  cell. 

•  A  clocking  scheme  has  been  devised  for  shifting  the  charge  from  the  out¬ 
put  of  one  CCD  through  the  CVAMP  to  the  input  of  the  next  CCD  during 
one  clock  cycle.  This  scheme  allows  charge  to  move  through  only  one 
CCD  bit  and  therefore  prevents  the  loss  of  any  correlation  samples, 

•  The  CCD  cells  have  been  designed  and  laid  out. 

•  Circuitry  has  been  designed  and  simulated  to  allow  the  reference  signal 
to  be  loaded  into  the  static  shift-register  under  microprocessor  control. 

•  The  differential  current  integrators  and  output-summing  amplifiers  have 
been  designed  and  simulated. 

•  The  complete  clocking  requirements  of  the  device  have  been  evaluated  and 
the  various  loading  capacitances  calculated.  The  clock  drivers  have  been 
designed  and  simulated  and  are  currently  being  laid  out.  The  clock 
generation  circuitry  has  been  designed  and  simulated. 

•  Accuracy  considerations  require  that  the  Xl  and  Xi  signal  amplifiers, 
which  are  located  ahead  of  the  correlators,  be  placed  off-chip. 
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SECTION  II 


GENERAL  PURPOSE  BINARY/ANALOG  CORRELATOR  DESIGN 


A .  Filter  Length,  Weighting  Accuracy,  and  Mode  Selection  Circuitry 

To  externally  select  anyone  of  the  nine  filter  configurations  a  four-bit 
input  word  is  required,  which  therefore  necessitates  an  on-chip  four-bit  to  nine- 
line  decode  circuit.  Table  1  shows  the  input  word  required  to  select  any  one  of 
the  nine  filter  configurations.  The  outputs  from  the  decode  circuitry  are  fed 
to  the  input  and  output  switches  S1-S10,  (see  Figure  3  for  identification  of 
switches).  A  number  of  the  lines,  however,  may  be  combined  as  some  of  the  input 
switch  positions  in  the  single,  dual,  and  quad  modes  are  identical,  while  the 
output  switches  are  in  "don't  care"  positions.  The  resulting  lines  that  can  be 
combined  through  OR  gates  are  3  with  6,  and  4  and  7  with  9.  (For  definition  of 
the  various  filter  configurations  see  Table  1.)  It  is  therefore  only  necessary 
to  route  six  control  lines  to  the  input  switches  and  five  lines  to  the  output 
switches.  Figure  5  shows  the  switching  circuitry  for  selecting  the  filter 
length  and  weighting  coefficient  accuracy. 

In  addition  to  the  switching  for  selecting  the  length  and  weighting  accuracy 
of  the  filters,  input  and  output  mode  switches  are  required  to  interconnect  the 
various  inputs  and  outputs  for  the  single,  dual,  and  quad  modes  of  operation. 

Mode  selection  is  implemented  by  using  OR  gates  in  conjunction  with  switches  across 
the  various  input  and  output  lines  [see  Figures  6(a)  and  6(b)],  The  mode 
selection  switching  results  in  four  output  and  sixteen  input  leads.  (The  sixteen 
inputs  are  due  to  the  XI  and  Xj  functions  being  off-chip;  in  Figure  3  each 
128-stage  correlator  incorporates  a  differential  pair.) 

B .  Programmable  Weighting  _^inq  a  Four-Phase  CCD 

In  the  original  proposal  for  the  binary/analog  correlator,  No.  23-R77.  the 
programmable  weighting  scheme  was  discussed  with  reference  to  a  two-phase  CCD 
structure.  Since  that  time  a  four-phase  CCD  technology  has  been  developed, 
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Table  1 


Filter  Configuration  Versus  Input  Select  Word 


ri!ter  Configuration 

Mode 

I  ipu  L 

Word 

A 

B 

C 

D 

(I) 

1024-stage  by 

-bi  t 

Single 

V 

0 

0 

0 

(2) 

51  2-stage 

by 

2 

bi  ts 

Single 

0 

1 

0 

0 

(3) 

256-stage 

by 

4 

bits 

Single 

1 

1 

0 

0 

(4) 

1  28-stage 

by 

8 

bits 

Single 

0 

0 

1 

0 

(51 

51  2-stage 

by 

1 

bi  t 

Dual 

1 

0 

1 

0 

(6) 

256-stage 

by 

2 

bi  ts 

Dua  1 

0 

1 

I 

0 

(7) 

1  28-s  tage 

by 

4 

bits 

Dual 

1 

1 

1 

0 

(8) 

256-stage 

by 

1 

bi  t 

Quad 

0 

0 

0 

1 

(9) 

1  28-stage 

by 

2 

bi  ts 

Quad 

1 

0 

0 

1 
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From  4-bit  to  9-line  decode  circuit 


Figure  5  Circuitry  for  Selecting  Filter  Length  and  Weighting 
Coefficient  Accuracy 
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I  >nput 


A1 

ON 

ON 

OFF 


Figure  6(a)  Input  Switches  for  Single,  Double,  and  Ouad  Mode  Selection 


in  Quad 
Modes  8,  9 


offering  the  advantages  of  fewer  processing  steps  and  higher  yield.  The  pro¬ 
grammable  weighting  scheme  has  been  adapted  to  suit  the  four-phase  structure, 
and  Figure  7  shows  a  section  of  such  a  CCD  with  plots  of  the  surface  potential 
at  various  times  during  charge  transfer  and  the  driving  waveforms. 

In  a  similar  way  as  was  accomplished  with  the  two-phase  structure,  pro¬ 
grammable  weighting  is  implemented  by  selectively  controlling  the  time  of 
charge  transfer  from  one  well  to  the  next.  The  four-electrode  CCD  has  two 
clocked  electrodes,  0^  and  0^,  and  a  dc-biased  barrier  electrode,  0^,  which 
serves  to  reduce  clock  feedthrough  from  02  to  the  sense  electrodes,  0^.  The 
0^  electrodes  are  in  common  on  the  output  summing  bus,  which  is  tied  to  an 
integrator  that  maintains  it  at  an  intermediate  dc  bias  level. 


At  time  t ^  the  charge  packets  are  shared  between  the  0j ,  and  02  electrodes. 

At  time  t2  the  0f  clock  is  turned  off,  leaving  the  charge  under  the  02  storage 

and  control  electrodes.  At  time  t.,  the  02  electrode  corresponding  to  q^  is 

turned  off,  thereby  transferring  to  under  the  sense  electrode  0^.  Meanwhile, 

other  charge  packets  such  as  q  +1  (depending  upon  the  code  in  the  digital 

n 

shift-register)  have  been  left  unchanged  as  those  02  electrodes  have  not  been 


clocked  to  the  off  state. 


At  time  t^  the  02  electrodes  that  were  held  on  at 


t^  are  turned  off,  resulting  in  all  the  remaining  charge  packets  being  trans¬ 
ferred  to  the  0^  electrodes.  The  weighting  on  any  one  sample  in  the  correlator 
is  therefore  determined  by  how  many  of  the  M  parallel  CCD  charge  packets  were 
transferred  to  the  0^  sampling  electrodes  at  t^.  If  all  M  parallel  bits  were 
transferred  at  t^>  the  weighting  coefficient  would  be  1;  if  none,  the  weighting 
would  be  zero.  Hence,  the  output  of  the  integrator  between  t^  and  t^  is  pro- 
portiona'  to  the  sum  of  the  charge  packets  that  were  transferred  due  to  the 
corresponding  02  electrodes  being  turned  off.  At  t^  the  remaining  02  electrodes 
are  switched  off  so  that  all  the  charge  packets  reside  under  the  0^  electrodes. 
Between  t^  and  tj  the  integrator  is  reset;  the  cycle  then  repeats. 
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As  noted  in  Section  I. A,  for  each  section  of  the  correlator  two  CCD  channels 
are  operated  in  parallel,  both  being  controlled  by  the  same  digital  shift-register. 
The  input  signal  and  its  complement  are  applied  to  the  two  CCD  channels,  while 
the  output  sense  buses  (0^)  are  differentially  summed  by  the  differential  current 
integrator  (DCI).  Thus,  the  bias  charge,  or  fat  zero,  does  not  produce  an 
output  from  the  DCI;  and  there  is  no  code  dependent  offset.  These  features  are 
inherent  in  the  device  operation. 

C.  Digital  Shift  Register 

Figure  8  shows  the  circuit  diagram  of  one  stage  of  the  static  digital  shift 

register,  together  with  the  controlling  and  output  waveforms.  The  shift  register 

holds  the  reference  signal/code  that  determines  the  time  of  charge  transfer  from 

02  to  0^  and  is  operated  by  three  nonoverlapping  clocks,  0^,  0g,  and  0^.  To 

control  whether  the  shift-register  is  connected  serially  for  loading  or  whether 

each  bit  is  continuously  recirculated  within  the  same  stage  in  the  storage  mode, 

0  is  routed  to  the  gate  of  either  Ml  or  m8.  The  circuitry  for  controlling  the 
B 

routing  of  0  allows  microprocessor  loading  of  the  shift  register  and  is  discussed 
B 

in  the  following  section.  If  a  digital  1  is  loaded  into  the  shift  register 
stage,  a  long  pulse,  02^’  resu^ts>  50  that  no  sampling  occurs  under  the  0^ 
electrode  in  the  corresponding  CCD  cell.  If  the  bit  is  a  0,  a  short  pulse, 

02£,  occurs,  with  the  result  that  the  charge  packet  is  sampled. 

Operation  of  the  digital  shift  register  can  be  explained  by  letting  the 

load/hold  control  circuitry  route  the  0  pulse  to  the  gate  of  Ml  and  letting 

B 

the  load  input  go  high. 

When  <0D  goes  on,  the  input  turns  M2  on,  so  that  when  0D  turns  off,  the  source 

O  D 

of  M3  discharges  to  ground  through  M2.  The  next  event  is  when  0^.  goes  on,  this 
sets  the  output  of  the  shift  register  to  zero  and  hence,  the  corresponding  CCD 
02  electrode.  When  0^  goes  on,  the  gate  of  M5  is  connected  to  the  drain  of  M2 
through  m4  and  is  therefore  set  to  a  "0."  At  the  same  time  the  output  of  the 
shift  register  tracks  0^  through  m6.  When  0^  goes  off,  the  shift  register  out¬ 
put  is  left  floating  high  because  M5  is  turned  off.  Consequently,  a  long  02 
pulse  is  generated;  only  when  0^  comes  on  will  the  output  be  reset  to  zero.  Now 
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let  the  control  circuitry  route  the  next  0^  pulse  to  the  gate  of  the  hold  tran¬ 
sistor  M8.  When  M8  is  clocked,  the  high  output  of  the  shift  register  is  routed 
to  the  gate  of  M2;  hence,  the  previous  cycle  is  repeated,  resulting  in  the  genera¬ 
tion  of  another  long  02  pulse. 

If  the  input  via  Ml  had  been  a  "0"  rather  than  a  "I,"  then  when  0  went 

D 

off  the  drain  of  M2  would  have  been  left  floating  high.  Hence,  when  0^  went 
on,  the  gate  of  M5  would  have  been  set  to  a  "l ,"  resulting  in  the  source  of  M6 
being  discharged  through  M5  when  0^  turned  off.  This  would  have  caused  a  short 
0 2  pulse  to  be  generated. 

D.  Microprocessor  Interface  Circuitry 

To  permit  microprocessor  loading  of  the  digital  shift  register,  load/hold 
circuitry  must  route  the  0  clock  to  the  gates  of  the  load  transistors  at  times 
determined  by  the  presence  of  an  asynchronous  strobe  pulse  generated  by  the 
microprocessor.  The  presence  of  the  strobe  pulse  indicates  that  the  micro¬ 
processor  is  ready  to  send  new  data;  this  data  must  be  accepted  by  the  digital 
shift  registers  during  the  next  cycle.  If  the  strobe  pulse  does  not  occur 
during  any  one  cycle,  the  0  clock  must  be  routed  to  the  gate  of  the  hold 
transistor,  so  that  the  old  data  can  be  reci rculated.  The  circuit  of  Figure  9 
fulfills  these  requirements.  ( 

The  circuit  comprises  an  RS  flip-flop,  whose  output  is  set  whenever  a 
strobe  pulse  is  received.  The  output  of  the  flip-flop  turns  MC  on  when  0 
clocks  MA;  MB  is  held  off  by  the  action  of  the  inverter  II.  Consequently,  the 
gate  of  M8,  the  recirculate  transistor,  is  held  low.  When  0^  clocks  MD,  MF 
is  held  off  and  ME  is  turned  on  by  the  action  of  12,  which  also  resets  the 
flip-flop.  Hence,  when  0g  goes  high  it  is  routed  to  the  gate  of  Ml,  the  load 
transistor. 

If  there  is  a  cycle  when  no  strobe  pulse  is  received,  then  the  flip-flop 
output  remains  low,  and  consequently  when  0^  goes  high,  MC  stays  off,  and  MB 
is  turned  on  by  II.  When  0ft  goes  high,  MF  is  turned  on  and  ME,  by  the  action  of 
12,  is  turned  off.  Consequently  when  0g  goes  high,  it  is  routed  through  MB  to 
the  gate  of  M8,  the  hold  transistor. 
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Figure  9  microprocessor  Interface  Circuitry  Showing  Generation  of  Load 
and  Hoi d  Pu 1 ses 


To  provide  the  microprocessor  with  a  data  acknowledge  signal,  the  Q,  output 
of  the  RS  flip-flop  is  taken  off-chip.  This  signal  enables  the  microprocessor, 
so  that  new  data  can  be  presented  to  the  static  shift  registers. 

E .  Charge  Transfer  Between  CCD  Registers 

The  architecture  of  the  general-purpose  correlator  requires  the  output  of 
each  128-stage  CCD  register  to  be  folded  back  along  its  length  to  the  input  of  the 
next  stage.  This  constraint  requires  the  output  charge  of  each  CCD  register  to 
be  converted  to  a  voltage  by  a  precision  unity  gain  charge  amplifier.  The 
amplifier  output  is  fed  to  the  input  of  the  following  CCD  where  a  voltage  to 
charge  conversion  is  performed.  This  conversion  must  result  in  the  input  charge 
packet  being  identical  in  size  to  the  charge  packet  transferred  to  the  output 
of  the  preceding  CCD.  An  additional  constraint  is  that  the  transfer  of  charge 
between  CCDs  must  take  place  during  one  clock  period  without  the  loss  of  a 
correlation  sample.  The  scheme  described  in  the  following  sections  meets  these 
requirements. 

1 .  Clocking  Scheme 

Figures  10(a)  and  10(b)  show  the  output  and  input  sections  of  the 
CCDs  with  plots  of  surface  potential  at  various  times  during  the  charge 
transfer  cycle.  Figure  10(c)  shows  the  waveforms  required  to  control  the 
transfer  of  charge  between  the  CCD  registers. 

At  time  t^  all  the  clocks  are  off,  and  the  charge  packets  are  located 
under  the  sampling  electrodes  0^.  At  time  t^  0j,  0  ,  and  0'  are  turned  on; 
hence,  charge  is  transferred  to  0^  and  0 ^ >  or  at  the  output  0^  and  0^.  0^  is 
turned  off  at  t^,  thereby  transferring  all  the  charge  to  0 ^  or  0^-  At  t^  the 
input  gate  of  the  next  CCD  is  turned  on  and  at  t^,  0'  turned  off.  This  transfers 
the  charge  packet  n  to  the  sampling  electrode  0^A  that  is  connected  to  the  unity 
gain  amplifier.  The  amplifier  connects  the  charge  under  0^A  to  a  voltage,  which 
sets  the  input  diode  potential  of  the  following  CCD  and  therefore  the  amount  of 
charge  that  is  stored  under  0".  0"  is  driven  by  the  same  clock  which  drives  all 

the  other  0^  electrodes,  but  has  independent  amplitude  control,  thereby  allow¬ 
ing  the  fat  zero  reference  level  to  be  "fine  tuned." 
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Figure  10(a)  CCD  Output  Stage  Showing  Surface  Potential  Variations 
During  Charge  Transfer  Between  CCD  Registers 


27 


At  tg  the  input  gate  is  turned  off  and  hence  the  CVAMP  has  between 
t  and  tg  to  settle,  which  at  a  clock  rate  of  1  MHz  is  approximately  370  ns. 

0 1  is  turned  on  at  t^  so  that  the  charge  packet  that  was  located  under  0^  is 
dumped  into  the  reverse-biased  diode  at  the  end  of  the  CCS  register.  At  the 
same  time  the  0^  clocks,  including  0^,  are  selectively  turned  off,  depending 
on  the  reference  code  in  the  digital  shift  register,  and  so  sampling  under  the 
first  0^  electrode  in  each  CCD  is  performed  as  normal.  Figure  10(b)  shows 
the  charge  packet  not  being  sampled  under  the  first  0^  electrode  and  the 
succeeding  packet  not  being  sampled.  After  0'  is  turned  off,  the  CVAMP,  and 
hence  the  input  diode  potential,  is  reset. 

The  clocking  scheme  therefore  results  in  each  128-stage  CCD  having  the 
cell  at  each  end  of  the  register  split  between  itself  and  the  adjacent  CCDs  in 
the  serial  chain.  Furthermore  the  transfer  scheme  is  inverting,  which  requires 
the  outputs  from  the  differential  CCD  channels  to  be  interchanged  when  they  are 
connected  to  the  inputs  of  the  succeeding  CCD  pair. 

2.  Charqe-to-Vol taqe  Amplifier 

A  key  part  of  the  circuitry  for  successful  transfer  of  charge  between 
CCD  shift-registers  is  a  precision  unity  gain  charge  amplifier.  As  the  general- 
purpose  correlator  requires  16  such  amplifiers,  primary  requirements  are  low- 
power  consumption  and  smal 1 -si  1  icon  area.  In  addition,  for  operation  at  1  MHz 
with  a  3  V  signal  swing,  the  amplifier  must  settle  to  1  %  in  ~  350  ns.  High 
open  loop  gain  is  not  required,  as  any  inaccuracy  in  the  closed  loop  unity 
gain  mode  can  be  compensated  by  adjustment  of  the  feedback  capacitor  Cp. 

An  NMOS  amplifier  circuit  that  fulfills  the  above  requirements  is  shown 
with  its  characteristics  in  Figure  11.  The  amplifier  is  composed  of  a  single 
differential  gain  stage  and  a  differentia]  to  single-ended  conversion  stage.  The 
feedback  path  between  Ml  and  m8  also  provides  for  improved  common  mode  rejection. 
The  output  is  taken  via  a  source-follower  stage  that  is  designed  to  drive  a  load 
capacitance  of  up  to  5  pF.  Open  loop  unity  gain  frequency  compensation  is  pro¬ 
vided  by  C  ,  which  also  limits  the  amplifier  slew  rate.  The  amplifier  occupies 

c  2 

an  area  of  0.5  mm  that  includes  both  reset  transistors  and  the  closed  loop 
gain  adjustment  capacitor  Cp 


1 
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F.  Differential  Current  Integrator  Design 

The  DC  I  amplifier  is  shown  in  Figure  12  with  the  related  circuit  elements. 
The  10.7  pF  feedback  capacitor  allows  a  full  8  V  output  swing  when  a  filter 
processing  gain  of  0.25  is  used  and  maximum  signal  is  present  in  the  CCD.  The 
Phase  1  clock  feedthrough  is  cancelled  by  a  complementary  clock  and  cancellation 
capacitance.  Fat  zero  signal  must  be  eliminated  by  the  common  mode  rejection  of 
the  amplifier  (more  than  60  dB  are  required).  An  open  loop  gain  of  greater 
than  1000  is  required  for  gain  stability.  Be  ause  of  the  parasitic  capacitances, 
the  amplifier  must  be  phase-compensated  for  a  closed  loop  gain  of  6.7.  Operation 
at  a  1  MHz  data  rate  requires  that  the  DCI  settle  in  —  350  ns. 

A  schematic  of  the  DCI  amplifier  is  given  in  Figure  13.  The  bias  string 
Ml4  to  M 1 8  is  matched  to  the  level  shifters  M5  to  m8  and  MlO  to  M 1 3 ,  so  that 
the  quiescent  operating  point  is  well  controlled.  The  input  differential  pair 
Ml  -  M2  operate  at  a  current  of  65  p,A,  each  giving  a  gain  of  26  in  the  first 
stage.  The  voltage  at  node  10  is  fed  back  to  the  tail  current  source  M9  by  the 
source  follower  M5  to  provide  increased  common  mode  rejection  and  perform  a 
differential  to  single-ended  conversion  in  the  first  stage.  The  source  follower 
level  shifter  MlO  to  M 1 3  provides  the  correct  dc  voltage  to  the  second  stage. 
Phase  shift  through  this  level  shifter  is  minimized  by  the  high  frequency  feed 
forward  capacitor,  CLS0. 

The  second  stage  gain  of  43  is  realized  with  an  enhance  gain  stage  M 1 9  to 
M22.  About  130  p,A  flow  through  both  the  cascode  transistor  M20  and  the  current 
source  M21.  M27  buffers  the  second  stage  from  both  the  output  buffer  load  and 

a  portion  of  the  low  frequency  compensating  capacitor,  CCF,  to  increase  the 
bandwidth  of  the  second  stage. 

The  output  is  a  source  follower  operating  at  300  p,A  of  current  in  order  to 
drive  a  load  of  15  pF.  Additional  current  for  pull  down  is  provided  by  the  diode 
MLS,  which  also  controls  the  operating  point  of  the  second  stage  during  negative 
output  slewing. 
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Closed  Loop  Gain 
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Figure  12  DCI  Circuit  with  Parasitics 
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The  open  loop  transfer  function  of  the  DCI  amplifier  is  given  in  the  SPICE 
simulation  output  of  Figure  14.  The  unity  gain  crossover  frequency  is  31  MHz, 
and  the  zero  phase  margin  frequency  is  18  MHz.  A  transient  simulation  for 
+/-  4  V  output  steps  is  illustrated  in  Figure  15.  All  steps  settle  to  eight 
bits  in  less  than  300  ns.  The  performance  of  the  amplifier  is  summarized  in 
Table  2. 

G.  Correlator  Clocking 

Figure  16  shows  the  relative  timing  of  the  ten  clocks  required  to  operate 

the  binary/analog  correlator.  0^,  0^,  and  Q  are  used  to  control  the  operation 

of  the  digital  shift  register,  which  in  turn  controls  the  0 ^  CCD  clock,  as 

discussed  in  Section  II. C.  0',  0';  anc*  IPG  control  the  transfer  of  charge 

between  CCD  registers  (see  Section  II. E).  0^  is  the  remaining  CCD  clock,  and 

0  -  are  the  DCI  reset  clocks;  0  and  0  are  also  used  to  reset  the  CVAMPs.  The 
R  2  B  C 

clock  timing  has  been  designed  to  allow  the  maximum  possible  settling  time  for 
the  DCIs  and  charge  amplifiers.  At  a  clock  rate  of  I  MHz  these  times  are  ~  375  ns. 

The  clock  driver  circuit  is  shown  in  Figure  17-  The  six  devices  driven  by 
the  input  lines  act  as  buffers  between  the  large  gates  of  the  output  devices  and 
the  input  signal  drivers.  They  also  provide  an  appropriate  delay  so  that  the 
bottom  of  the  bootstrap  capacitor  MB  is  held  at  ground  as  an  ON  signal  is  applied. 

After  a  short  delay,  M2  is  turned  off.  The  source  of  Ml  is  bootstrapped  to 
~  1.5  Vpg,  ar|d  the  load  voltage  can  rise  to  within  a  threshold  of  V^.  The  device 
aspect  ratios  are  determined  by  the  required  risetime  and  load  capacitance.  From 
the  layout  of  the  CCD  and  digital  shift  register  cells,  the  various  clock  loading 
capacitances  have  been  calculated  and  are  given  in  Table  3-  The  high  load  capacitances 
result  in  the  drive  circuits  occupying  a  considerable  portion  of  the  bar  area, 

~  20%.  The  area  of  the  drive  circuits,  designed  to  drive  the  various  load 
capacitances  with  a  20  ns  rise  time,  are  also  included  in  Table  3. 
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Figure  15  DC  I  Step  Response  for  +/-  4  V  Output  Swing 
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Table  2 


PCI  Amplifier  Performance 


Operating  Voltages: 

Currents: 
Power  Pis: 


VDD  =  15  V,  GND  =  0  V,  VBB 
(FDD/GND)  =  1 .07  MA 
16.1  MW 


-5  V 


Open  Loop  Gain:  1058 

Output  Resistance:  544  Ohms 

Closed  Loop  (15  Pf  Load)  Gain  =  6.7 
Bandwidth  =  4.7  MHz 
Phase  Margin  =  78  DEG 
Gain  Margin  =  2.73 
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CVAMP  Sense  Time 


Table  3 


Loading  Capacitances  and  Corresponding  Clock  Driver  Areas 
for  20  ns  Rise  Time 

Clock  Phase  Capacitive  Loading  Driver  Area,  mi  1 s 


*1 

425  PF 

400 

*2 

10  pF 

57 

IPG 

7  PF 

57 

0.' 

7  PF 

57 

0A 

720  pF 

840 

0B 

140  pF 

213 

0c 

60  pF 

127 

0R2 

10  pF 

57 

*R1 

10  pF 

57 

10  pF 

57 
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The  maximum  frequency  of  device  operation  is  primarily  determined  by  the 
power  dissipation  of  the  drivers.  The  frequency  limit  turns  out  to  be  about  1  MHz, 
as  with  80%  efficient  drivers  and  a  15  V  clock  swing,  the  power  dissipation  is 
~  400  mW.  To  estimate  the  total  chip  dissipation,  the  driver  dissipation  must  be 
added  to  that  of  the  circuitry  for  generating  the  clock  waveforms,  ~  150  mW;  the 
seven  DCJs^  ***  140  mW;  and  the  16  charge  amplifiers,  ~  100  mW:  a  total  of  ~  800  mW. 
In  addition  to  the  power  dissipation  restricting  the  maximum  frequency  of  operation 
to  ~  1  MHz,  neither  the  DCIs  nor  the  CVAMPs  will  settle  quickly  enough  to  permit 
operation  at  much  beyond  this  frequency.  j 
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